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SEARCH ENGINE USER INTERFACE 

The present invention generally relates to a search 
engine user interface which' allows a user to enter a 
5 query and view representations for data which most 
closely match the query. In particular, the present 
invention relates to an interface which allows a user to 
view the data which best matches the query as a two- 
dimensional arrangement of representations spaced in 

10 accordance with the similarity in the data. 

In computer systems where a large amount of data is 
stored, there is a problem in providing a simple and 
accurate method of searching for and retrieving data . 
Data can be stored in many different formats such as text 

15 e.g. documents, audio data e.g. music, image data e.g. 
pictures , and video data e.g. movies . Also , with the 
rapid growth in the worldwide web, web search engines 
have been developed. These form data bases of web pages 
which are searchable using key words. 

2 0 For data which contains text, the data can be 

searched using text queries e.g. key word searches or 
natural language queries. Such search techniques have 
been used for many years. 

Work has been carried out and published in a paper 

25 by R Beale ("Foreign Interactions" Interfaces 37, pages 
23 to 26, 1998) with regard to document retrieval. In 
the system disclosed in the Beale paper a text query 
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results in a three dimensional display likened to a 
"dandelion head", A central node represents the query 
and outlying spheres represent documents returned by that 
query. The size of the node represents the document size 
5 and closer the node is to the centre the higher the 
degree of matching between document and the query. The 
document can be selected by clicking on a node. If a 
second query is entered, a second "dandelion head" is 
generated with any common node being linked. Although 

10 this method provides more information than is available 
by simply listing the documents, it does not show the 
relationship between the documents returned as a result 
of the search. Also, navigation of the three-dimensional 
space is limited. 

15 Another document search technique is disclosed in an 

article by M Zizi entitled "Interactive Dynamic Maps for 
Visualisation and Retrieval from Hypertext Systems" (in 
"Information Retrieval and Hypertext", M. Agosti and A 
Smeaton (eds), Kluwer Academic, 1996). In this system 

20 documents are displayed in positions on a map in a lay- 
out according to relationships between the documents . A 
user can then visualise the similarity between documents 
and select and retrieve the desired document. 

For non-text data, it is known that meta data can be 

2 5 provided for the data. Such meta data can include text 
e.g. a description of the content of the data file to 
which the meta data is associated. Such meta data is 
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thus searchable using conventional key word searches or 
natural language searches. Such a technique is disclosed 
in US 5493677. 

These techniques for searching non-text data 
5 generate search results which usually simply consist of 
either outputting the best match or a list of matches in 
order of matching. This does not help a user to 
understand the relationship between the query and the 
data returned in the search results, nor does it provide 

10 the information regarding the relationships between the 
data returned. 

Another method of searching non-text data is to use 
non-text queries. For example, audio data can be 
searched using a particular audio signature. In a paper 

15 by Y Rubner et al entitled "Adaptive Colour- Image 
Embeddings for Database Navigation" (Proceedings 1998 
I.E. Asian Conference on Computer Vision, Hong Kong), a 
system is disclosed which allows for the searching of 
image data using a colour query. The input colour query 

20 is used to determine the closest matches to ''the query and 
thumbnail images are then displayed on a two-dimensional 
map where the images are arranged in dependence upon 
their colour similarities. A user is able to navigate 
through the database of colour images by selecting colour 

2 5 images of interest. A plurality of selected colour 
images are averaged to form the next query and the map is 
redisplayed centred on the query. Although this method 
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provides a map for a user which is navigable, the 
information it provides is of very limited use. 

It is one object of the present invention to provide 
a user interface to a search engine of a database which 
5 allows the user to obtain a better understanding of the 
relationship between the query and the returned data and 
the relationship between the data which has been returned 
itself . 

This object of the present invention is achieved by 

10 a database interface and method, in which the database 
has a plurality of sets of non-text data. Each set of 
non-text data has a descriptive text caption associated 
with it. An input text query is received and compared 
with the captions to determine the similarities between 

15 the input query and the captions. A display means is 
controlled to display representations of a plurality of 
sets of non-text data which have captions which are the 
most similar to the input query and which are separated 
in accordance with similarities. 

20 This object of the present invention can also be 

achieved by database interface to a database of a 
plurality of sets of non-text data where each set of non- 
text data in the database has a descriptive text caption 
associated with it. An input textual query is received 

2 5 and compared with the captions for the sets of non-text 
data to determine the similarities between the input 
query and the captions. The most similar set of non-text 



data is then selected and display means is controlled to 
display a representation of the selected set of non-text 
data and representations of a plurality of sets of non- 
text data having the most similar captions to the caption 
of the selected set of non-text data. The 
representations are displayed separated in accordance 
with the similarities. 

Thus in accordance with this aspect of the present 
invention the user is able to identify relationships 
between sets of data such as images or audio files , based 
on the arrangement of the representations of the sets of 
data. The arrangement is based on the textual caption 
and not the actual content of the sets of data. Thus the 
textual caption is able to describe the content of the 
set of data to which it is associated thus facilitating 
a more user friendly database interface. This is 
particularly so when natural language is used and in an 
embodiment the comparison is carried out by comparing the 
meaning of the captions and the query. 

In an embodiment of the present invention the 
representation for the set of data having the most 
similar caption to the query is arranged in a central 
region of the display. The other representations are 
then arranged around it. This focuses the user on the 
best match and allows a user to see that the poorer 
matches lie around the periphery. 



The representations of the sets of image data can 
comprise any form of representation. For instance, for 
image data, the representation can comprise a thumbnail 
image. For video data, the representation can comprise 
a reduced video frame. For other non-text data, the 
representation can comprise an icon specifically 
generated to represent the data . Any form of 

representation can be used which will help the user 
understand the content of the set of data. 

It is an object of another aspect of the present 
invention to allow a user to select representations and 
thus navigate through the database with ease. 

This object of the present invention is achieved by 
providing an apparatus for accessing a database of sets 
of data. Each set of data has similarity data giving the 
similarity of the data to other sets of data and the 
identity of the other sets of data. A query is received 
and compared with the sets of data or meta data for the 
sets of data to determine at least the most similar set 
of data. A representation for the most similar set of 
data is displayed together with representations for the 
other sets of data arranged in accordance with their 
similarity. The user is able to select a representation 
and the similarity data for the set of data corresponding 
to the selected representation is retrieved. In response 
to the retrieval of the similarity data, the selected 
representation and the other sets of data identified in 
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the similarity data are displayed arranged in accordance 
with their similarities. 

Thus this aspect of the present invention, by 
providing prestored similarity data, avoids the need for 
5 the recalculation of similarity values when an image is 
selected. When an image is selected it becomes 
effectively a query. Because this is a fixed parameter 
already known by the data base, the similarity between 
each set of data and the most similar sets of data can be 

10 stored as the similarity data. This avoids the need for 
a recalculation of similarity values every time a 
representation is selected as a new query. Thus the 
system is able to respond more rapidly to the users 
selection to refresh the display. 

15 When the user makes the initial query, in one 

embodiment only the most similar set of data is 
determined. The similarity data for the most similar set 
of data is then retrieved and the display means is 
controlled to display a representation of the most 

20 similar set of data and representations of the other sets 
of data identified in the similarity data. 

In an alternative embodiment a plurality of the most 
similar sets of data to the query are determined. The 
display means is then controlled to display 

25 representations of the most similar sets of data arranged 
in accordance with the similarity values- 
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Another aspect of the present invention provides a 
method and apparatus for displaying the relationships 
between sets of data. Sets of data and similarity values 
for the similarity between the sets of data are received. 
5 Display means is then controlled to display 
representations for the sets of data separated in 
accordance with said similarity values. Links are 
displayed in a first style between the representations 
which correspond to sets of data having a strong 

10 similarity value. Links are displayed in a second style 
between the representations which correspond to sets of 
data having a weak similarity. 

Thus this aspect of the present invention provides 
further information to a user. The user is able to 

15 identify easily those sets of data which are similar and 
dissimilar. The links which are shown in the first style 
tend to act as links which pull the representations 
together on the display. The links which are displayed 
in the second style will tend to repell the 

20 representations . 

In a further aspect of the present invention a data 
display apparatus and method displays the relationships 
between sets of data to allow the animated display of the 
arrangement. Sets of data and similarity values for the 

25 similarity between the sets of data is received. The 
optimum separations of the displayed representations of 
the sets of data are iteratively calculated from starting 



separations to target separations corresponding to the 
similarity values. Display means are then controlled to 
display the representations arranged in accordance with 
the iterative calculations at time sequential stages of 
the iterative calculations. In this way a user is able 
to see the representations moving from initial positions 
to optimum positions in order to try to achieve the 
target separations . Thus a user is able to see which 
representations tend to move towards and away from one 
another thus enhancing the visual understanding of the 
relationship between the sets of data. 

Embodiments of the present invention will now be 
described with reference to the accompanying drawings, in 
which: - 

Figure 1 is a diagram of the interface displayed to 
a user of an embodiment of the present invention 
following the input of a query "red toy car"; 

Figure 2 is a diagram of the interface displayed to 
a user following the selection of an image of Figure 1 as 
a query; 

Figure 3 is a schematic functional diagram of an 
embodiment of the present invention; 

Figure 4 is a schematic diagram of an embodiment of 
the present invention; 

Figure 5 is a flow diagram showing the overall 
process of an embodiment of the present invention; 
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Figure 6 is a flow diagram illustrating the method 
of calculating the similarity matrices off line; 

Figure 7 is a flow diagram illustrating the method 
of calculating map data for the similarity matrixes off 
5 line; 

Figure 8 is a flow diagram illustrating the method 
of rendering the map; 

Figure 9 is a flow diagram illustrating in more 
detail the steps carried out in step S45 of the flow 
10 diagram of Figure 8; 

Figure 10 is a schematic functional diagram of a 
second embodiment of the present invention; 

Figure 11 is a flow diagram of the overall operation 
of the second embodiment of the present invention; and 
15 Figure 12 is a flow diagram illustrating the method 

of calculating map data on line. 

Two embodiments of the present invention will now be 
described. Both of these embodiments comprise an 

20 interface for a search engine for searching for images 
within an image database. Searching is performed using 
natural language queries which can comprise simple key 
words or syntactically structured natural language. Each 
of the images has associated with it meta data in the 

25 form of captions which describe the subject matter of the 
image. Such captions are manually entered during the 
formation of the database. 
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FIRST EMBODIMENT 

The first embodiment of the present invention will 
now be described with reference to Figures 1 to 9, 

Referring to Figure 1, this diagram illustrates the 
5 interface to a user. A user operates a computer with a 
web browser such as Microsoft Explorer™. Within the web 
browser r a particular HTML file is opened which invokes 
a Java applet. The Java applet configures the display to 
include a query input window 100 with a submit button 

10 101. This enables a user to type in a query and by 
clicking on the submit button 101 the query can be sent 
to a search engine as a search query. The query window 
100 is of course initially empty and in Figure 1 the 
query "red toy car" has been entered. Below the query 

15 window 100 three buttons are provided. A refresh button 
102 is provided for causing the re-rendering of the map 
data as will be described in more detail hereinafter. An 
options button 103 is provided to allow for the 
configuration of various parameters as will be described 

2 0 in more detail hereinafter. Also, as is conventional, a 
help button 104 is provided to allow a user to access 
information to help them use the interface. These 
buttons can be selected as is conventional within a web 
browser by moving a pointer using for example a mouse and 

25 clicking. Next to the buttons there is provided status 
information in a status display region 105. The status 
information that can be provided comprises: 



1. "Ready" indicating that the interface is awaiting 
an input from the user. 

2. "Downloading map data" indicating that the 
interface is clearing data for forming the map". 

3. "Organising images" indicating that the interface 
is rendering the map. 

4. "Sending query" indicating that the interface is 
sending a query to search engine. 

Underneath the buttons 102, 103, 104 and the status 
information display region 105 there is provided a region 
106 in which is displayed a caption for an image in the 
image data base to be searched. Initially this is empty 
but once a map has been formed, it contains the caption 
of the best match to the query hereinafter termed a 
"focal image". 

Below the caption section 106 of the display, there 
is provided a map window 107 which comprises the bulk of 
the display. It is within this window that the search 
results from the search engine are mapped. Initially of 
course, the map window 107 is empty. As can be seen in 
Figure 1, once a query has been entered map data in the 
form of thumbnail images is formed. It can be seen in 
Figure 1 that as a result of the query "red toy car" an 
image 108 is highlighted the centre of the map window 
107. This is a thumbnail image of the image data which 
most closely matches the input query. The closest match 
therefore becomes the focus of the map and the next 
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closest matching images are used to form thumbnail images 
about the focal image 108. The proximity of the thumbnail 
images to the focal image depends on the similarity 
between the thumbnail images and the focal image. It can 
5 clearly be seen in Figure 1 that cars are clustered 
around the focal image 108. The images are allowed to 
overlap. Between the images links can be shown. For 
example , between the focal image 108 and the image 109 a 
link 110 is illustrated. This helps to illustrate the 

10 relationship between the images. Links can also be shown 
between the images surrounding the focal image 108. 

The arrangement of the thumbnail images within the 
map window 107 is dependent upon the similarity between 
them based on their captions. The similarity measure in 

15 this embodiment is determined off line and is contained 
within a database. The input can be matched to the 
captions in the database to determine the best match. 
Once the best match is found, predetermined similarities 
between the best match and other entries in the database 

2 0 are used. This will be described in more detail 
hereinafter. 

Also within the map window 107 there is provided a 
thumbnail image 111 which acts as a "back button" to 
allow the user to return to a previous map following a 
2 5 navigation around the map as will be described next. 

A user is able to navigate around the image database 
by selecting a thumbnail image using a pointer and 



clicking on the thumbnail. When a user selects the 
thumbnail image 112 in Figure 1, the selected thumbnail 
image becomes the focal image and the most similar images 
to the focal image are then displayed around the new 
focal image. As can be seen in Figure 2, the five 
thumbnail images of cars which includes the original 
focal image 108 remain as does image 109. The remaining 
images are new. Once again the focal image 112 is 
displayed highlighted. Also, the caption window 10 6 now 
displays the caption for the new focal image. 

The functionality of this embodiment of the present 
invention will now be described with reference to 
Figure 3 which illustrates the functional components of 
this embodiment. 

A user interface 1 is provided which comprises a 
pointing device 2 such as a mouse, an input device 4 such 
as a keyboard and a display device 3. When a user inputs 
a query using the query input device 4, this is sent to 
the on-line similarity calculator 9. This will access 
meta data in the meta data database 6 via the storage 
device control 5 in order to identify the best match 
between the query and the meta data entries for the data. 
Once a best match has been found, an image is identified 
using corresponding map data stored in the map data 
database 7 and corresponding data from the database 8 is 
retrieved and passed to the rendering processor 10 for 



rendering to form the map in the map window 107 on the 
display device 3. 

In order to form the. map data in the database 7, an 
off line process has to take place prior to the use of 
the system. An off line similarity calculator 11 
accesses the meta data in the database 6 in order to 
determine the similarities between the meta data entries. 
These similarity values are then used to form map data 
using the map data calculator 12 . The calculated map 
data is then stored in the map data database 7. 

Figure 4 is a schematic diagram of an implementation 
of the present invention. This embodiment comprises a 
server and client linked over a network 28. Because in 
this embodiment a web browser is used, the network can 
comprise any form of network such as a local area network 
or the Internet. The client comprises a bus 20 linking 
the components of the client system together. A display 
21 is provided for providing a display output of the form 
as shown in Figures 1 and 2. A processor 22 implements 
code stored in a program storage device 24. Web browser 
code within the program storage device 2 4 enables the 
processor 32 to implement a web browser 32a. HTML and 
Java code is stored in the program storage device 2 4 and 
enables the web browser 22a to launch a Java applet 22b. 
A pointing device 23 such as a mouse is provided to allow 
a user to select displayed items. A keyboard 2 6 is 
provided to allow the user to enter a query. Data 
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storage 25 contains map data and image data to enable the 
map in the map window 10 7 of the display to be rendered. 
An interface 27 is provided for interfacing to the 
network 28 to enable the client to access the server. 
5 The server is provided with a bus 30 linking 

components together. A network interface 29 is provided 
to link the server to the network 28. A processor 37 is 
provided to implement programs stored in the program 
storage device 34. Web server code stored in the program 

10 storage device 34 is loaded by the processor 31 in order 
to implement a web server 31a. A database application 
program is loaded by the processor 31 in order to 
implement a database application 31c. A map data 
calculation program stored in the program storage device 

15 34 is loaded by the processor 31 in order to implement a 
map data calculation application 31b. A matching program 
is loaded by the processor 31 in order to implement a 
matching application 31b. 

The server is also provided with a storage device 32 

20 comprising an image database 32a, a caption database 32b 
and a map data database 32c. These databases store 
respective image captions and map data. In order to 
generate the caption database 32b, a server is also 
provided with an caption input device 33 which can for 

25 example comprise a keyboard to allow an operator to 
manually enter the captions for the images. 
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As can be seen from Figure 4, this embodiment of the 
present invention can be implemented over the network 
wherein the client operates a single Java application. 
The bulk of the processing is carried out at the server. 
5 The operation of this embodiment of the present 

invention will now be described in detail with reference 
to the flow diagram of Figures 5 to 9* 

Figure 5 is a flow diagram illustrating the overall 
operation of this embodiment of the present invention. In 
10 step SI an operator creates captions for the image data 
base. The captions are then stored in the captions 
database in step S2. A caption can comprise a 
description of the subject matter of the images to enable 
natural language queries to be used for searching the 
15 image database. 

In step S3 the similarity between the captions (and 
thus the images) is determined in order to generate 
similarity matrices. A similarity matrix is generated 
for each image and comprises similarity values for only 
20 a number of closest or best match images. Thus, 
similarities need not be kept for similarities between 
all images. The details of the generation of the 
similarity matrices will be described hereinafter with 
reference to Figure 6- 
25 Using the similarity matrices map data is generated 

for each image in the image database in step S4. This is 
then stored in the map data database in step S5 . Details 
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of the generation of the map data will be described 
hereinafter in more detail with reference to Figure 7 . 

Steps SI to S5 comprise processing steps which take 
place before a user enters a query. In other words the 
image database , caption database, and map data databases 
are formed off-line ready for a user to enter queries. 

The on-line process following the input of a query 
by a user will now be described. 

In step S6 a user inputs a query such as illustrated 
in Figure 1 by typing in for example "red toy car". This 
is submitted by using the submit button 101. Using the 
on-line similarity calculator 9 the best match between 
the input query and the captions are determined in order 
to identify the corresponding map ID in step S7. Using 
the determined map ID, in step S8 map data is retrieved 
from the map data database. In step S9 the map is 
rendered using the map data and by retrieving image data 
referenced in the map data. In step S10 the process will 
then await a mouse input. When a mouse input is received 
in step Sll it is determined whether this input has 
occurred within the map window. If there was a map input 
within the map window, there are three possible inputs: 
double click, click and drag. If a double click event 
occurs, in step S12 the map ID of the selected thumbnail 
is obtained and the process returns to step S8 whereby 
the map ID is used to retrieve the map data. This 
process is illustrated in the transition from the 
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displays of Figures 1 and 2. In the display of Figure 1 
the user has selected the image 112 by double clicking 
which results in the display of Figure 2 wherein the 
image 112 becomes the focal image. 
5 If the mouse event in the map window is a click 

event, in step S13 the selected image is displayed at an 
enlarged size to enable a user to more easily view it. 
In order to avoid cluttering the map window 107 , only one 
image can be selected to be displayed enlarged at a time. 

10 If another image is selected whilst an image is displayed 
enlarged, the previously selected image is returned to 
its thumbnail size and the newly selected image is 
displayed at an enlarged size. 

If the mouse input in the map window comprises a 

15 drag event, a thumbnail image which had been selected to 
be dragged is moved within the map window. Any links 
between the thumbnail images are maintained and move as 
necessary during the movement of the selected image in 
step S14. This ability to allow a user to move images 

20 enables a user to manually rearrange the thumbnail images 
either to allow the rendering process to be repeated 
using the refresh button 102, or merely to pull out an 
image which is partially covered by other images so that 
it can viewed. For instance, a user may wish to move the 

25 thumbnail image 108 illustrated in Figure 2 which is 
partly obscured in order to be able to see the full 
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thumbnail image. This may be the image that the user is 
seeking in the search. 

In step Sll if the .mouse input is not in the map 
window, in step S15 it is determined whether the mouse 
5 input comprises the selection of a button. If not, in 
step S16 the mouse input is ignored. If a button is 
selected, if the refresh button is selected the process 
then returns to step S9 to re-render the map. The 
re-rendering process may be desirable for several 

10 reasons. As will be described hereinafter, the rendering 
process includes the calculation of the separation of the 
images for a limited period of time. In other words, the 
algorithm which determines the spacing of the images 
iterates towards the solution but is only allowed a 

15 certain time to do it. A user may wish to allow the 
iterations to continue for a longer and thus the refresh 
button 102 can be selected for this purpose. Also, if a 
user has manually moved a thumbnail using the process of 
step S14, this may help in clarifying the rendering 

20 process. Because the arrangement of the thumbnail images 
from the map window 107 is determined by an algorithm 
which has to take into consideration competing forces 
between images , the map displayed may not be the only 
"solution". Thus because it is possible to move a 

25 selected thumbnail and re-render, a more clear map may 
result . 



If the button selected in step S15 is determined to 
be the help button, in step S17 the help menu is 
displayed. The help menu includes information to help a 
user to use the software. 

If in step S15 it is determined that the options 
button has been selected, in step S18 the options menu is 
displayed. The options menu allows various 

configurations of options of the system to be set. The 
linkage strategy can be set so that either: 

1. Radial links are set i.e. the links from the focal 
image to each of the other images, 

2. All of the links are shown, or 

3. None of the links are shown. 

Using the options menu the number of thumbnail 
images rendered in the map can also be selected. As 
mentioned hereinabove, the similarity matrices do not 
contain similarity values for all images . In the present 
embodiment the number of thumbnails N is set to 10. Thus, 
each matrix is a 10 by 10 matrix wherein for an image 
similarity values for the 9 best matches are included. 
The options menu allows the number of thumbnails to be 
reduced to a number less than N. This means that images 
with the lowest similarity value to the focal image will 
not be displayed. It is not possible to show a number of 
thumbnail images greater than the number in the 
similarity matrices because the similarity values are not 
available . 
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Another options setting which is available is to 
allow the web browser to preload images. Thus when this 
is selected, instead of the map being formed wherein the 
images are gradually displayed as image data is 
5 retrieved, no map data is illustrated until all the 
images are available. This causes a delay but ensures 
that all the images are available for the rendering 
process before the rendering process begins. 

In step S15 of Figure 5 if the button selected is 

10 the submit button, this means that a new query has been 
input and thus the process returns to step S6. 

As can be seen from Figure 5, the embodiment of the 
present invention allows a user to manually input a 
natural language query, or effectively enter a query by 

15 a selecting an image. A method of entry of a query by 
selection of an image enables a user to navigate the 
image database. If during navigation the original image 
is not included in the closest matches surrounding the 
new focal image, a user is able to return to the previous 

20 map data using the back button 111 which is the image of 
the previous focal image. 

In this embodiment the generation of the similarity 
matrices takes place off line by determining the 
similarity between the entries in the database- This has 

25 the benefit of reducing the processing time required. It 
is only necessary simply to identify the best match 
between the input query and the captions in the captions 
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database. However, this suffers from the disadvantage 
that the best matches illustrated in the map are the best 
matches to the focal image and not the best matches to 
the query. A benefit of this however is that the user is 
5 given an impression of the pattern of data in the 
database. 

The method of generating the similarity matrices 
(step S3 in Figure 5) will now be described with 
reference to Figure 6. 

10 In step S20 an initial caption is selected from the 

caption database. In step S21 the best N-l best matches 
are found. The matching process in this embodiment can 
comprise a simple keyword matching but preferably 
comprise a full natural language phrase matching 

15 technique as disclosed in UK Patent Application No. 
9904662 and UK Patent Application No. 9821969 the 
contents of which are hereby incorporated by reference. 
The best matches are found by comparing similarity values 
returned from the matching process. Thus the result of 

20 step S21 is a series of values which will be used to form 
the radial links in the map data. 

The process then goes on in step S22 to calculate 
the values for the similarity of each of the other 
captions with each other. The purpose of this is 

25 generate a full N x N matrix so that not only can radial 
links be determined in the rendering of the map, but also 
links between outlying images. In step S23 the values 
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are output of an N x N matrix indexed by the image ID for 
the selected caption. In the matrix the rows and columns 
are indexed by image IDs' for the best matches. In step 
S24 the next caption is selected and the process returns 
5 to step S21. The process in Figure 6 thus repeats for 
each of the captions in the caption database to form the 
similarity matrices. 
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Table 1 below illustrates the similarity matrix for 
the focal image of Figure 1. 

TABLE 1 





17/71 


17/72 


17/73 


5/62 


6/35 


6/36 


6/37 


6/38 


6/39 


6/40 


17/71 


1.00 


0.51 


0.67 


0.00 


0.67 


0.36 


0.42 


0.34 


0.65 


0.40 


17/72 


0.51 


1.00 


0.49 


0.00 


0.64 


0.36 


0.38 


0.32 


0.62 


0.38 


17/73 


0.67 


0.49 


1.00 


0.00 


0.55 


0.31 


0.33 


0.27 


0.53 


0.32 


5/62 


0.00 


0.00 


0.00 


1.00 


0.02 


0.28 


0.03 


0.03 


0.00 


0.09 


6/35 


0.67 


0.64 


0.55 


0.02 


1.00 


0.59 


0.63 


0.52 


0.93 


0.61 


6/36 


0.36 


0.36 


0.31 


0.28 


0.59 


1.00 


0.36 


0,64 


0.53 


0.38 


6/37 


0.42 


0.38 


0.33 


0.03 


0.63 


0.36 


1.00 


0.34 


0.56 


0.40 


6/38 


0.34 


0.32 


0.27 


0.03 


0.52 


0.64 


0.34 


1.00 


0.47 


0,36 


6/39 


0.65 


0.62 


0.53 


0.00 


0.93 


0.53 


0.56 


0.47 


1.00 


0.56 


6/43 


0.40 


0.38 


0.32 


0.09 


0.61 


0.38 


0.40 


0.36 


0.56 


1.00 



Table 2 below illustrates the similarity matrix for 
the focal image of Figure 2 - 
20 TABLE 2 





16/70 


17/69 


17/70 


17/71 


17/72 


17/74 


6/35 


6/39 


6/43 


6/44 


16/70 


1.00 


0.00 


0.09 


0.00 


0.00 


0.24 


0.00 


0.00 


0.22 


0.41 


17/69 


0.00 


1.00 


0.19 


0.03 


0.00 


0.29 


0.00 


0.00 


0.26 


0.37 


17/70 


0.09 


0.19 


1.00 


0.23 


0.27 


0.12 


0.26 


0.25 


0.27 


0.08 


17/71 


O.OO 


0.03 


0.23 


1.00 


0.51 


0.36 


0.67 


0.65 


0.20 


0.00 


17/72 


0.00 


0.00 


0.27 


0.51 


1.00 


0.33 


0.64 


0.62 


0.20 


0.00 


17/74 


0.24 


0.29 


0.12 


0.36 


0.33 


1.00 


0.27 


0.26 


0.42 


0.46 


6/35 


0.00 


0.00 


0.26 


0.67 


0.64 


0.27 


1.00 


0.93 


0.25 


0.05 


6/39 


0.00 


0.00 


0.25 


0.65 


0.62 


0.26 


0.93 


1.00 


0.22 


0.00 


6/43 


0.22 


0.26 


0.27 


0.20 


0.20 


0.42 


0.25 


0,22 


1.00 


0.46 


6/44 


0.41 


0.37 


0.08 


0.00 


0.00 


0.46 


0.05 


0.00 


0.46 


1.00 
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In the similarity matrices given in the tables 
above, the images are identified by an image ID of the 
format "directory/image number". Thus the first column 
in Table 1 is for similarity values for the 10 images 
5 similar to image number 71 in directory 17. 

It can be seen that the diagonal window of the 
matrix has values of 1. Below the diagonal line the 
values will repeat what appears above. Thus only the 
upper or lower part of the matrix need be stored for the 

10 present technique wherein the similarity measure between 
images is symmetric i.e. it does not matter which image 
you take as the reference in the comparison e.g. 
comparing image 17/71 with image 17/72 is equivalent to 
comparing image 17/72 with image 17/71. 

15 The process of calculating the map data from each of 

the similarity matrices will now be described with 
reference to Figure 7. 

In step S30 an initial matrix is selected. In step 
S31 captions for each image ID used to index the matrix 

2 0 is output to a map data file indexed by the map ID. The 
mean, standard deviation, maximum and minimum values in 
the matrix are then calculated in step S32. In order to 
reduce the number of links which are used in the map 
data, only links which are significant are kept. 

25 Significant links are determined in step S33 by 
identifying values in the matrix that are either indexed 
by image ID for the selected matrix, or values which 
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differ by more than n standard deviations from the mean, 
where n = 1.25 in this embodiment. Thus all of the radial 
links are identified together with links which indicate 
a strong similarity or a strong dissimilarity. In step 
5 S34 the identified values are then normalised to values 
which are suitable for displaying as links i.e. 
separations between the images. The normalised values 
are then output to the map data file and indexed by the 
caption sequence i.e. if a link value is output for a 
10 link between the first and fourth image in the sequence, 
the output format is " 1-4/iength" , where "length" is the 
link length. 

In step S3 6 the image ID for each of the images in 
the caption sequence is then output to the map data 

15 file. Then in step S37 the caption sequence index number 
of the image ID for the selected matrix is output as an 
indication of the focal image in order to complete the 
data in the map data file. In step S3 8 the next matrix 
is then selected and the process returns to step S31. 

20 The map data file for the image 6/3 6 i.e. the focal 

image of Figure 1 is indicated below: 

red watermelon flesh with black seeds sunk into the 
surface. | red rose stems in bud on a jet-black surface. | 
2 5 sunrise, producing a bright white light and an orange 
glow over planet Earth's surface, as seen from outer 
space. | one red toy car and one blue toy car on a white 
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surface. | shiny red toy sports car on a white surface. 
| dark blue Beetle-shaped toy car on a white surface. | 
yellow toy Ferrari sports car on a white surface. | 
colourful clown toy car on a white surface. | metal toy 
motorcycle and sidecar on a white surface. | soft toy 
teddy bear, with tartan pouch, on a white surface. 

4-0/192 4-1/192 4-2/175 4-3/85 5-3/50 5-4/50 6-4/75 
7-3/75 7-4/75 7-5/75 7-6/75 8-4/153 9-4/153 

17/img0071 . jpg 17/img0072 . jpg 17/img0073 . jpg 
5/img0062. jpg 6/img0035 . jpg 6/img0036 . jpg 6/img0037 . jpg 
6/img0038. jpg 6/img0039 . jpg 6/img0043 . jpg 

5 

The map data file for the image 6/43 i.e. the focal 
image of Figure 2 is given below: 

brightly coloured party balloons on a white surface. | 
one red toy car and one blue toy car on a white surface - 
| shiny red toy sports car on a white surface, j dark 
blue Beetle-shaped toy car on a white surface. | yellow 
toy Ferrari sports car on a white surface. | colourful 
clown toy car on a white surface. | metal toy motorcycle 
and sidecar on a white surface, | soft toy teddy bear, 
with tartan pouch, on a white surface, | soft toys 
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sitting by and peeping from a wooden toy-box. | teddy 
bear riding an old, red tin toy truck. 

2-1/85 3-1/50 3-2/80 4-2/75 5-1/75 5-2/75 5-3/75 5-4/75 
5 7-0/212 7-1/153 7-2/153 7-3/153 7-4/153 7-5/153 7-6/153 
8-7/300 9-7/151 

16/img0 07 0 . jpg 17/img00 6 9 . jpg 17/img0070 . jpg 
17/img0 071 . jpg 17/img0 07 2 . jpg 17/img0 0 74 . jpg 
10 6.img0035.jpg 6/img0039 . jpg 6/img0043 . jpg 6/img0044 . jpg 

8 

As can be seen above, the captions and the image 
15 IDs' are output as a sequence, the link length data is 
output as lengths indexed in accordance with the 
sequence. The final piece of data from the map data file 
is the indication of the focal image by the use of the 
index to refer to the image ID in the sequence. Thus in 
2 0 the data file for image 6/36 the final data entry 5 
indicates that the fifth position in the sequence 
identifies the focal image. 

It is not essential to use this final piece of data 
to indicate the focal image. Instead the sequence could 
25 be ordered such that the focal image is always the first 
in the sequence. 
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Although the generation of the similarity matrices 
and the map data files have been described as two 
separate processes wherein once the matrices have been 
formed , the map data files are formed, in practice, in 
5 order to reduce storage, the map data file calculation is 
carried out immediately following the generation of the 
similarity matrix for a selected caption. Thus the 
similarity matrices are not stored and are transient: 
they are merely used as an intermediary to form the map 

10 data files. 

The resulting map data files are stored in the map 
data database for use in the rendering of the map in the 
map window 107. Map data files contain all of the 
information necessary for the rendering process. The 

15 rendering process merely needs to retrieve the image data 
identified in the map data file. 

The rendering process for the formation of the map 
data in the map window 107 will now be described in 
detail with reference to Figures 8 and 9. 

20 In step S40 the map data is input having a map ID 

for the best matching caption or having the map ID of the 
selected thumbnail. The thumbnail of the focal image is 
retrieved and rendered at the centre of the map window in 
step S41. In step S42 the thumbnails of the other images 

25 indexed in the map data file are rendered at 
predetermined positions around the focal image. These 
predetermined positions can be at equally spaced 
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positions around the focal image or at random positions. 
The starting positions are not particularly important. 

In step S43 a counter i is set to zero and in step 
S44 the counter is incremented. In step S45 new 
5 thumbnail positions are calculated and the thumbnails are 
rendered in the new positions. Following this in step 
S46 the average thumbnail position (termed curr.ave) is 
determined. In step S47 it is then determined whether 
the counter i has reached 10. If not, the process returns 
10 to step S44. 

Thus steps S44 to S47 are repeated ten times. In 
other words, the new thumbnail positions are calculated 
and the thumbnails are rendered at those positions ten 
times and each time the average thumbnail position is 
15 determined. 

In step S4 8 a ten point moving average of the 
average thumbnail position (termed mov.ave) is 
determined. 

In step S49 it is determined whether the current 
20 average position is within a range of the ten point 
moving average and if the time taken is greater than 
5 seconds. If so the rendering process has completed and 
the process stops in step S50. If not in step S51 it is 
determined whether the time T is greater than 15 seconds. 
25 If so the process terminates in step S50 and if not the 
process returns to step S43. Thus, in step S49 it is 
determined whether there is still significant movement of 



the images or whether they have converged towards a 
solution. The process is allowed at least 5 seconds to 
iterate but no more than -15 seconds. 

Figure 9 illustrates in more detail the process of 
calculating the new thumbnail positions i.e. step S45 of 
Figure 8 . 

In step S60 a first link is selected i.e. the 
separation of two images corresponding to two images 
which have a separation value in the map data file. The 
link length is calculated by measuring the x and y 
distances between the nodes (images). This is the actual 
length of the link (ACTUAL„LEN) . The equation used is: 

ACTUAL — LEN~\J (XDIST) 2 + ( YDIST) 2 

where XDIST is the distance between the nodes along 
the X axis and YDIST is the distance between the nodes 
along the Y axis. 

In step S62 the proportional difference between the 
actual length and the target length, i.e. the length in 
the map data file is then calculated using: 

AC T UAL — LEN- TARGET— LEN 

PDIFF- ■ — 

ACTUAL— LEN 



where PDIFF is the proportional difference 
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ACTUAL_LEN is the actual length calculated and 
TARGE T__LEN is the target length. 

In step S63 5X and 5Y are calculated from: 



bX-XDIST x PDIFF 
bY^YDIST x PDIFF 

5 

In step S64 5X and 5Y are added to the "from" node 
i.e. the node identified by the first index in the map 
data files and in step S65 5X and 5Y are subtracted from 
the "to" node. For example, in the map data file for 

10 image 6/3 6 i.e. the map data file for Figure 1, the first 
link is given in the format 4-0/192. This identifies the 
link as being from the fourth image in the sequence to 
the zero image in the sequence and the link length is 
192. (In the map data file, the 10 images are indexed 

15 from 0 to 9). Thus step S64 requires the addition of 5X 
and 5Y to the position of the fourth image in the caption 
sequence i.e. 6/35 and the substraction of 5X and 5Y from 
the zero image in the sequence i.e. image 17/71. This 
causes the nodes to then move towards the required 

20 length. Because the value of 5X and 5Y can be negative, 
although step S64 is addition and step S65 is 
substraction, this does not result in the nodes moving 
closer together but can result in the nodes moving 
further apart. 



In step S6 6 it is then determined whether there are 
any more links to be processed and if so, in step S68 the 
next link is selected and the process returns to step 
S61. If all the links have been processed, in step S67 
the thumbnails are rendered at their new positions. Thus 
the process of Figure 9 comprises a single iteration in 
the rendering process. 

This embodiment of the present invention thus 
provides a simple method allowing a user to enter a query 
and to be presented with a map of thumbnail images in 
which the focal image comprises the best match to the 
query The surrounding images comprise images which are 
the best matches in the database to the focal image. 
This enables a user to navigate through the database by 
selecting an image as a new focal image. Such a 
technique is fast because there is no need to perform on- 
line matching. There is no query input and instead, the 
ID of the select image is used to retrieve the map data 
for the generation of the next map, 

SECOND EMBODIMENT 

A second embodiment to the present invention will 
now be described with reference to Figures 10 to 12. 

Figure 10 is a schematic diagram of the second 
embodiment of the present invention - 

Many of the functional components of the second 
embodiment are similar to the functional components of 
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the first embodiment as illustrated in Figure 3 and thus 
the same reference numerals are used. In this embodiment 
the interface 1 comprises* the pointing device 2 the query 
input device 4 and the display device 3. When a query is 
5 input, the on-line similarity calculator 9 not only 
determines the best match between the input query and the 
meta data in the meta data database 6, it also calculates 
the N best matches between the input query and the data 
in the meta data database 6. This is used to calculate 

10 similarity values between the input query and the N best 
matches. The similarity data is then passed to an on- 
line map data calculator 13 for calculating map data on- 
line from the similarity values. The map data calculator 
13 refers to the meta data database 6 and the data in the 

15 database 8 in order to form the map data. This data is 
then passed to the rendering processor 10 in order to 
render a map in the map window 107 in a similar manner to 
that shown in Figure 1 . 

In this embodiment, the off-line similarity 

20 calculator 11 and the off-line map data calculator 12 
operates as in the first embodiment to generate map data 
for the map data database 7. This data is used whenever 
the pointing device 2 is used to select an image. In 
other words when an image is selected there is no need to 

25 perform any matching operation on-line and it is possible 
to retrieve the map data simply using the ID of the 
selected image. However, when a user inputs a query, the 
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map data generated is not based on similarity to an entry 
in the database, but instead is based on similarity to 
the query. 

The operation of the second embodiment of the 
5 present invention will now be described with reference to 
Figure 11 which is a flow diagram illustrating the 
over al 1 operation . 

In step S70, captions for the image database are 
created and in step S71 they are stored in the database. 

10 In step S72 similarity matrices are generated in 
accordance with the flow diagram of Figure 6 and in step 
S73 off-line map data is generated in accordance with the 
flow diagram of Figure 7 as in the previous embodiment. 
In step S74 the map data is stored in the database. 

15 Steps S72 to S74 comprise off-line processing and are the 
same as in the first embodiment. 

When a query is input in step S75, in step S7 6 the 
N best matching captions are found and in step S77 a 
similarity matrix is generated on line. This comprises 

20 the determination of the similarity between the N best 
matches to the input query. In step S78 map data is 
generated from the similarity matrix. The map data is 
generated on-line. This will be described in more detail 
with reference to Figure 12 hereinafter. The map data is 

25 then rendered in step S79 and the process then awaits a 
mouse input in step S80. Steps S81 to step S88 
correspond to steps Sll to S18 in Figure 5 and the 
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process operates in the same way as in the first 
embodiment for mouse inputs- A description of steps S81 
to S88 is thus omitted for the sake of brevity. Thus the 
second embodiment only differs from the first embodiment 
5 in that steps S7 to S9 are replaced with steps S76 to 
S79. 

The method of generating map data will now be 
described with reference to Figure 12. In step S90 the 
captions for each image ID in the similarity matrix are 

10 output as a matrix caption sequence indexed by map ID. 
In step S91 the mean, standard deviation , maximum and 
minimum of the values in the matrix are then calculated. 
In step S92 significant values in the matrix are then 
identified either by the fact that they are indexed by 

15 the image ID of the best match, or by the fact that they 
differ by more than n standard deviations from the mean. 
These values are then normalised suitably for display as 
lengths (separations) in step S93. The normalised values 
are then output indexed by caption sequence to the map 

20 data file in step S94 and in step S95 the image ID in 
caption sequence order is output to the map data file. 
Finally, the caption sequence index number of the image 
ID for the best match is output as the identification of 
the focal image in step S96. 

25 This embodiment of the present invention allows a 

user to enter a query and observe a map of images which 
most closely match the query. The process does however 
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require the determination of a similarity matrix and map 
data on-line- Once the map is rendered a user is able to 
navigate the image database in the same manner as for the 
first embodiment. 

5 

MODIFICATIONS TO THE EMBODIMENTS 

The present invention has been described hereinabove 
with reference to two specific embodiments, the present 
invention is not limited to these embodiments and 

10 modifications can be made within the spirit and scope of 
the present invention. 

In particular, the present invention is not limited 
to the searching of image data. The present invention is 
applicable to any form of data such as text data, audio 

15 data, image data, and video data. When video data is 
searched, the rendered map can include images 
representative of a frame with a video. Alternatively, 
a specific thumbnail image of a frame can be provided as 
met a data to the video data. Similarly, for audio data, 

2 0 meta data can be provided which can comprise text. For 
text data, the thumbnail images can comprise an image of 
a document, an abstract, or a caption. Thus the present 
invention is applicable to any form of data wherein the 
rendering process generates a map having nodes which 

2 5 comprise representations for the data. 

When a node in the map is selected, the information 
outputting process will depend upon the form of the data. 
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For example, in the above embodiments, for image data, 
when an image is selected, the size of the thumbnail is 
increased. This can also take place for the 

representation of data in other forms* Alternatively, for 
5 audio data and for video data, at least some of the audio 
or video can be played back. 

Although in both embodiments, the searching is 
performed using meta data on the form of captions for 
images, the present invention is applicable to searching 

10 of any form of meta data or the data itself. For 
example, where the database contains text data to be 
searched, the query can be matched to the data itself, 
e.g. the title, the abstract or the whole text, or to 
meta data such as a document title, file name, abstract 

15 or caption which is extraneous to the data itself. 

In the embodiments, natural language queries or 
keyword queries have been used, the present invention is 
not limited to such queries . Any form of query can be 
used for searching the database. For example, where 

20 colour images are being searched, colour data or 
signatures can be used as an input query - 

In the embodiments described, the similarity measure 
used for determining the similarity between each of the 
images is based on a natural language matching process 

25 between the natural language query and the caption or 
between the captions themselves. The present invention is 
not however, limited to any particular form of similarity 
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measure- For example, the similarities between images 
can be based on the colour content of the images. 
Similarity between videos i.e. films can be based on the 
content of the film e.g. the genre of the film. 
5 In the embodiments the matching process to form the 

similarity measure is described as using the natural 
language matching procedure described in UK Patent 
Application 9904662 and UK Patent Application No. 9821969 
the content of which are hereby incorporated by 

10 reference . The invention can use any form of matching 
technique such as the cosine measure in order to 
determine how well things match. Using a natural language 
or keyword query together with captions or text in 
documents, the cosine measure can be used by considering 

15 each word in the query to comprise a dimension. Thus the 
query comprises a N-dimensional vector having values in 
each of the dimensions. The caption is mapped onto the 
N-dimension space and if the caption includes any of the 
words it is assigned a unit value in the particular 

2 0 dimension. Where no word is present it is assigned a zero 
value in a particular dimension. In this way, the caption 
and the query can be considered as N-dimensional vectors 
and a cosine measure can be obtained as a measure of the 
similarity between the query and the caption. 

2 5 In the embodiments, the links are shown as simple 

links between nodes in the map. The links can however be 
considered to be two different types of links: 



1. Links for which the similarity is high wherein the 
nodes are attracted, and 

2. Links for which the similarity is low wherein nodes 
which are repelled. 

Links which have values close to the mean are not 
mapped. Thus the two different types of link can be 
shown in different colours to show links which are 
pushing and pulling nodes . Further, where additional 
information is available regarding the similarity between 
the images , this can be used to identify the types of 
links between the images e.g. for natural language 
queries and natural language matching , the context of the 
match can be used to enhance the information available 
for the link. Such context information can be made 
available from implementing the technique in UK Patent 
Application No. 9904662. 

The invention has many applications and can be used 
for searching for and browsing through information and 
databases. The present invention is not limited to the 
retrieval of data and can be used for browsing for 
information even for placing requests for information. 
For example, the present invention can be applied to on- 
line shopping for searching for and ordering items, and 
for video-on-demand systems. In such systems the 
thumbnail images can comprise images of the items that a 
user wishes to order. The present invention is also 
applicable to web search engines where the images 
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displayed in the map can comprise images of the web 
pages. For natural language queries to allow the 
searching on such web pages , the captions can either be 
matched to the text content of the web page, or met a data 
for the web pages can be provided e.g. captions. 

The embodiments of the present invention have been 
described with reference to implementation over a network 
using Java applets. This implementation of the present 
invention is particularly convenient since it allows the 
invention to be implemented using a browser either 
locally on a machine wherein the client and server 
effectively comprises a single machine and the searching 
takes place of a database stored locally on a single 
machine, or the technique can be applied to allow 
searching of databases over a network. The present 
invention is not however limited to the specific 
implementation and it can be implemented either using 
specifically designed software or hardware. Since the 
present invention can be implemented in software, the 
present invention can be embodied as a storage media 
carrying computer code for controlling a processor to 
carry out the method. Also, because the computer program 
can be downloaded over a network such as the internet or 
a local area network, the present invention can be 
embodied as a signal carrying instructions for 
controlling a processor to carry out the method. 



The invention defined in the appended claims can 
comprise any combination of the claims and features in 
the claims. Also, although various means for performing 
functions are claimed, the means can be provided in 
combinations . 



WE CLAIM: 



I., A database interface for interfacing to a database 



text data having a descriptive text caption associated 
therewith, the interface comprising: 

receiving means for receiving an input textual 
query; 

comparing means for comparing said input query with 
said captions for said sets of non-text data to determine 
the similarities between said input query and said 
captions and between said captions; and 

display control means for controlling display means 
to display representations of a plurality of sets of non- 
text data which have captions which are the most similar 
to said input query , and for controlling said display 
means to display said representations separated in 
accordance with the similarities between their respective 
captions . 

2. A database interface according to claim 1 wherein 
each said descriptive text caption is text in a natural 
language, said input means is adapted to receive a 
natural language query, and said comparing means is 
adapted to perform a natural language comparison between 
said input query and said captions. 




non-text data, each set of non- 
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3. A database interface according to claim 2 wherein 
said comparing means is adapted to compare the meaning of 
said input query with the meaning of said caption, 

5 4. A database interface according to claim 1, wherein 
said display control means is adapted to select the 
representation of the set of non-text data having the 
most similar said caption to said query as a focal 
representation, and to control said display means to 
10 display said focal representation at a central position 
and to display the representations of the rest of said 
plurality of sets of non-text data around said focal 
representation at distances dependent upon the determined 
similarities - 

15 

5. A database interface according to claim 1, wherein 
said non-text data comprises image data and said display 
control means is adapted to control said display means to 
display thumbnail images of said sets of image data as 

2 0 said representations. 

6. A database interface according to claim 1 , including 
means for receiving a selection signal following the 
selection of a representation as a new query, said 

25 display control means being responsible to said selection 
signal to control said display means to display the 
selected representation at a central position and to 
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display the representations for the sets of non-text data 
which have captions most similar to the caption of the 
selected representation around the selected 
representation at distances in accordance with the 
5 similarities. 

7. A database interface according to claim 6 wherein 
said database includes a set of similarity measures 
stored for each set of non-text data, and said display 

10 control means is adapted to receive said sets of 
similarity measures from said database for a selected 
representation . 

8. database interface apparatus for interfacing a 
15 4oser interface to a database of a plurality of sets of 

non-text data, each set of non-text data having a 
descriptive text caption associated therewith, the 
apparatus comprising : 

means for receiving an input textual query from said 

20 user interface; 

means for comparing said input textual query with 
said captions for said sets of non-text data to determine 
the similarities between said input query and said 
captions and between said captions; and 

2 5 means for outputting information indentifying a 

plurality of sets of non-text data having captions most 
similar to said query to said user interface for the 
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display of representations of said sets of non-text data 
separated in accordance with the similarities. 

9. A user interface for use with said database 
5 interface apparatus of claim 8 and for displaying a 
plurality of representations of sets of non-text data, 
the user interface comprising: 

input means for inputting a text query; 

means for sending said input query to said database 
10 interface apparatus; 

means for receiving information identifying a 
plurality of sets of non-text data having the most 
similar captions to said query, and similarity values; 

means for retrieving said sets of non-text data from 
15 said database; and 

display means for displaying representations of said 
sets of non-text data separated in accordance with the 
similarity values. 

20 KkT A database interface for interfacing to a database 
^ of a plurality of sets of non-text data, each set of non- 
text data having a descriptive text caption associated 
therewith, the interface comprising: 

receiving means for receiving an input textual 
25 query; 

comparing means for comparing said input query with 
said captions for said sets of non-text data to determine 
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the similarities between said input query and said 
captions ; 

selecting means for selecting the determined most 
similar set of non-text data; and 

display control means for controlling display means 
to display a representation of the selected set of non- 
text data and representations of a plurality of sets of 
non-text data having the most similar captions to the 
caption of said selected set of non-text data separated 
in accordance with the similarities between their 
respective captions. 

11. A database interface according to claim 10 wherein 
said database stores the similarities between the caption 
for each set of non-text data and the captions of a 
plurality of the sets of non-text data which are most 
similar and identities said plurality of sets of non-text 
data, said display control means being adapted to 
retrieve said selected set of non-text data and said 
similarities and identities for said selected set of non- 
text data, and to retrieve said plurality of sets of non- 
text data in accordance with said retrieved identities . 

12. A database interface according to claim 10, wherein 
each said descriptive text caption is text in a natural 
language, said input means is adapted to receive a 
natural language query, and said comparing means is 
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adapted to perform a natural language comparison between 
said input query and said captions. 

13. A database interface according to claim 12, wherein 
5 said comparing means is adapted to compare the meaning of 

said input query with the meaning of said caption. 

14. A database interface according to claim 10, wherein 
said display control means is adapted to control said 

10 display means to display said representation of said 
selected set of non-text data as a focal representation 
at a central position and to display the representations 
of said plurality of sets of non-text data around said 
focal representation , 

15 

15. A database interface according to claim 10, wherein 
said non-text data comprises image data and said display 
control means is adapted to control said display means to 
display thumbnail images of said sets of image data as 

2 0 said representations. 

16. A database interface according to any one of claims 
10 to 15 including means for receiving a selection signal 
following the selection of a representation as a new 

25 query, said display control means being responsive to 
said selection signal to control said display means to 
display the selected representation at a central position 



and to display the representations for the sets of non- 
text data which have captions most similar to the caption 
of the selected representation around the selected 
representation at distances in accordance with the 
similarities . 

17. A database interface according to claim 16 wherein 
said database includes a set of similarity measures 
stored for each set of non-text data, and said display 
control means is adapted to receive said set of 
similarity measures from said database for a selected 
representation . 

18. A database interface apparatus for interfacing a 
user interface to a database of a plurality of sets of 
non-text data, each set of non-text data having 
associated therewith, a descriptive text caption and 
information identifying sets of non-text data having the 
most similar captions and giving the similarities; 

the apparatus comprising: 

means for receiving an input textual query from said 
user interface; 

means for comparing said input query with said 
captions for said sets of non-text data to determine the 
similarities therebetween ; 



means for outputting information identifying a set 
of non-text data having a caption most similar to said 
query to said user interface; and 

means causing said information for said set of non- 
text data to be sent from said database to said user 
interface for the display of representations of the 
identified sets of non-text data separated in accordance 
with the similarities. 

19. A user interface for use with said database 
interface apparatus of claim 18 and for displaying a 
plurality of representations of sets of non-text data, 
the user interface comprising: 

input means for inputting a text query; 

means for sending said input query to said database 
interface apparatus ; 

means for receiving information identifying sets of 
non-text data and comparative similarity values; 

means for retrieving said sets of non-text data from 
said database; and 

display means for displaying representations of said 
sets of non-text data separated in accordance with the 
similarity values. 

20. A database interface method for interfacing to a 
database of a plurality of sets of non-text data, each 



set of non-text data having a descriptive text caption 
associated therewith , the method comprising: 
receiving an input textual query; 

comparing said input query with said captions for 
said sets of non-text data to determine the similarities 
between said input query and said captions, and between 
said captions; and 

control display means to display representations of 
a plurality of sets of non-text data which have captions 
which are the most similar to said input query separated 
in accordance with the similarities between their 
respective captions . 

21. A database interface method according to claim 20, 
wherein each said descriptive text caption is text in a 
natural language, said query is input as a natural 
language query, and the comparing step performs a natural 
language comparison between said input query and said 
captions . 

22. A database interface method according to claim 21, 
wherein said comparing step compares the meaning of said 
input query with the meaning of said captions. 

23. A database interface method according to claim 20, 
including selecting the representation of the set of non- 
text data having the most similar said caption to said 



query as a focal representation, and controlling said 
display means to display said focal representation at a 
central position and to display the representations of 
the rest of said plurality of sets of non-text data 
around said focal representation at distances dependent 
upon the determined similarities. 

24. A database interface method according to claim 20, 
wherein said non-text data comprises image data, and said 
display means is controlled to display thumbnail images 
of said sets of image data as said representations. 

25. A database interface method according to claim 20, 
including receiving a selection signal following the 
selection of a representation as a new query, responding 
to said selection signal by controlling said display 
means to display the selected representation at a central 
position and to display representations of the non-text 
data which have captions most similar to the caption of 
the selected representation around the selected 
representation at distances in accordance with the 
similarities . 

26. A database interface method according to claim 25, 
wherein said database includes a set of similarity 
measures stored for each set of non-text data, and said 



set of similarity measures is received from said database 
for a selected representation. 

27. A database interface method for interfacing a user 
interface to a database of a plurality of sets of non- 
text data, each set of non-text data having a descriptive 
text caption associated therewith, the method comprising: 

receiving an input text query from said user 
interface ; 

comparing said input text query with said captions 
for said sets of non-text data to determine the 
similarities between said input query and the said 
captions, and between said captions; and 

outputting information identifying a plurality of 
sets of non-text data having captions most similar to 
said query to said user interface for the display of 
representations of said sets of non-text data separated 
in accordance with the similarities. 

28. A method of operating a user interface for use with 
database interface apparatus operating in accordance with 
the database interface method of claim 27 and for 
displaying a plurality of representations of sets of non- 
text data, the method comprising: 

receiving a text query; 

sending said query to said database interface 
apparatus ; 
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receiving information identifying a plurality of 
sets of non-text data having the most similar captions to 
said query and similarity values; 

retrieving said sets of non-text data from said 
5 database ; and 

displaying representations of said sets of non-text 
data separated in accordance with the similarity values. 

29. A database interface method of interfacing to a 
10 database of a plurality of sets of non-text data, each 

set of non-text data having a descriptive text caption 
associated therewith, the method comprising: 
receiving an input text query; 

comparing said input query with said captions for 
15 said sets of non-text data to determine the similarities 
between said input query and said captions; 

selecting the determined most similar set of non- 
text data; and 

controlling display means to display a 
20 representation of the selected set of non-text data and 
representations of a plurality of sets of non-text data 
having the most similar captions to the caption of said 
selected set of non-text data separated in accordance 
with the similarities between their respective captions. 

25 

30. A database interface method according to claim 29, 
wherein said database stores the similarities between the 



caption for each set of non-text data and the captions of 
a plurality of the sets of non-text data which are most 
similar and identities of said plurality of sets of non- 
text data, the method including retrieving said selected 
set of non-text data, and said similarities and said 
identities for said selected set of non-text data, and 
retrieving said plurality of sets of non-text data in 
accordance with said retrieved identities. 

31. A database interface method according to claim 30 
wherein each said descriptive text caption is text in a 
natural language , said query is received as a natural 
language query, and the comparing step performs a natural 
language comparison between said input query and said 
captions . 

32. A database interface method according to claim 31, 
wherein the comparing step compares the meaning of said 
input query with the meaning of said captions. 

33. A database interface method according to claim 29, 
wherein said display means is controlled to display said 
representation of said selected set of non-text data as 
a focal representation at a central position and to 
display the representations of said plurality of sets of 
non-text data around said focal representation. 



34. A database interface method according to claim 29, 
wherein said non-text data comprises image data and said 
display means is controlled to display thumbnail images 
of said sets of image data as said representations. 

35. A database interface method according to claim 29, 
including receiving a selection signal following the 
selection of a representation as a new query, responding 
to said selection signal to control said display means to 
display the selected representation at the central 
position and to display the representations for the 
normal text data which have captions most similar to the 
caption of the selected representation around the 
selected representation at distances in accordance with 
the similarities. 

36. A database interface method according to claim 35, 
wherein said database includes a set of similarity 
measures stored for each set of non-text data, the method 
including receiving said set of similarity measures from 
said database for a selected representation. 

37. A database interface method of interfacing a user 
interface to a database of a plurality of sets of non- 
text data, each set of non-text data having associated 
therewith a descriptive text caption, and information 
identifying sets of non-text data having the most similar 
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captions and giving the similarities; the method 
comprising: 

receiving an input text query from said user 
interface ; 

comparing said input query with said captions for 
said sets of non-text data to determine the similarities 
therebetween ; and 

outputting information identifying a set of non-text 
data having a caption most similar to said query to said 
user interface; and 

causing said information for said set of non-text 
data to be sent from said database to said user interface 
for the display of representations of the identified sets 
of non-text data separated in accordance with the 
similarities . 

38. A method of providing a user interface for use with 
database interface apparatus operating in accordance with 
the method of claim 37 and for displaying a plurality of 
representations of sets of non-text data, the method 
comprising: 

receiving a text query; 

sending said query to said database interface 
apparatus; 

receiving information identifying sets of non-text 
data and comparative similarity values; 

retrieving said sets of non-text data from said 
database; and 



displaying representations of said sets of non-text 
data separated in accordance with the similarity values. 

39. Data accessing apparatus for accessing each set of 
data having similarity data giving the similarity of the 
data, other sets of data, and the identity of the other 
sets of data, the apparatus comprising; 

receiving means for receiving a query; 
comparing means for comparing the query with said 
sets of data or meta data for said sets of data to 
determine at least the most similar set of data; 

display control means for controlling display means 
to display a representation for the most similar set of 
data and representations for other similar sets of data 
arranged in accordance with their similarity; 

selection means for allowing a selection of 
representation ; and 

retrieval means for retrieving said similarity data 
for the set of data corresponding to selected 
representation; 

wherein said display control means is adapted to 
control said display means to display the selected 
representation and representations for the other sets of 
data identified in said similarity data arranged in 
accordance with their similarities. 
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40. Data accessing apparatus according to claim 39 
wherein said comparing means is adapted to determine the 
most similar set of data; said retrieval means is adapted 
to retrieve said similarity data for said most similar 
set of data; and said display control means is adapted to 
control said display means to display a representation of 
said most similar set of data and representations of 
other sets of data identified in said similarity data 
arranged in accordance with their similarities. 

41* Data accessing apparatus according to claim 39 
wherein said comparing means is adapted to determine a 
plurality of the most similar sets of data to said query 
and their similarity values; and said display control 
means is adapted to control said display means to display 
representations of said most similar sets of data 
arranged in accordance with their similarity values. 

42. Data accessing apparatus according to claim 39, 
wherein said meta data comprises text, said receiving 
means is adapted to receive a text query, and said 
comparing means is adapted to compare the text query with 
text meta data for said sets of data. 

43. Data accessing apparatus according to claim 39, 
wherein said display control means is adapted to control 
said display means to display the representation for the 



most similar set of data centrally with the rest of the 
representations there around. 

44. A data accessing method of accessing sets of data, 
each set of data having similarity data giving the 
similarities of the data to other sets of data and the 
identity of the other sets of data, the method 
comprising: 

receiving a query; 

comparing the query with said sets of data or meta 
data for said sets of data to determine at least the most 
similar set of data; 

controlling display means to display a 
representation of the most similar set of data and 
representations for other similar sets of data arranged 
in accordance with their similarity; 

allowing a selection of a representation; 

retrieving said similarity data for the set of data 
corresponding to the selected representation; and 

controlling said display means to display the 
selected representation and representations for the other 
sets of data identified in said similarity data arranged 
in accordance with their similarities. 

45. A data accessing method according to claim 44, 
wherein the comparing step determines the most similar 
set of data; the retrieval step retrieves said similarity 



data for said most similar set of data; and the display 
means is controlled to display a representation of said 
most similar set of data and representations of other 
sets of data identified in said similarity data arranged 
in accordance with their similarities. 

46. A data accessing method according to claim 44 
wherein the comparing step determines a plurality of the 
most similar sets of data into said query and their 
similarity values; and the display means is controlled to 
display representations of said most similar sets of data 
arranged in accordance with their similarity values. 

47. A data accessing method according to claim 44, 
wherein said meta data comprises text, a text query is 
received, and the text query is compared with text meta 
data for said sets of data. 

48. A data accessing method according to claim 44, 
wherein said display means is controlled to display the 
representation for the most similar set of data centrally 
with the rest of the representations there around. 

49. Data display apparatus for displaying the 
relationships between sets of data, the apparatus 
comprising : 



data receiving means for receiving sets of data and 
similarity values for the similarity between the sets of 
data; and 

display control means for controlling display means 
to display representations for said sets of data 
separated in accordance with said similarity values and 
to display in a first style links between said 
representations which correspond to sets of data having 
a strong similarity value, and in a second style links 
between said representation which correspond to sets of 
data having a weak similarity value. 

50. Data display apparatus according to claim 49 wherein 
said display control means is adapted to control said 
display means to display said representations as images 
of said sets of data. 

51. Data display apparatus according to claim 49 , 
wherein said display control means is adapted to control 
said display means to display a said link in said first 
style if the similarity value associated with the link is 
above the means of the similarity values by a 
predetermined amount and to display a said link in said 
second style if the similarity value associated with the 
link is below the mean of the similarity values by a 
predetermined amount . 
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52. The data display method of displaying the 
relationship between sets of data, the method comprising: 

receiving sets of data and similarity values for the 
similarity between the sets of data; 

controlling display means to display representations 
for said sets of data separated in accordance with said 
similarity values; and 

displaying in a first style links between said 
representations which correspond to sets of data having 
a strong similarity value and in a second style links 
between said representations which correspond to sets of 
data having a weak similarity. 

53. A data display method according to claim 52 wherein 
said display means is controlled to display said 
representations as images of said sets of data. 

54. A data display method according to claim 52, wherein 
said display means is controlled to display a said link 
in said first style if the similarity value associated 
with the link is above the mean of the similarity values 
by a predetermined amount and to display a said link in 
said second style if the similarity value associated with 
the link is below the mean of the similarity values by a 
predetermined amount - 
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55. Data display apparatus for displaying the 
relationships between sets of data, the apparatus 
comprising: 

data receiving means for receiving sets of data and 
5 similarity values for the similarity between the sets of 
data; 

arrangement calculation means for calculating the 
optimum arrangement of representations for said sets of 
data on display means so that said representations are 
10 spaced according to said similarity values; 

display control means for controlling said display 
means to display said representations arranged in 
accordance with said calculation; and 

user selection means allowing a user to select and 
15 move a said representation; 

wherein said arrangement calculation means is 
operable to recalculate the optimum arrangement of said 
representations following the movement of a said 
representation . 

20 

56. Data display apparatus according to claim 55 wherein 
said arrangement calculation means is adapted to carry 
out iterative calculations of the separations of said 
representations from starting separations to target 

2 5 separations . 



57. Data display apparatus according to claim 5 6 wherein 
said display control means is adapted to control said 
display means to display said representations at time 
sequential stages of said iterative calculations. 

58. A data display method of displaying the 
relationships between sets of data, the apparatus 
comprising: 

receiving sets of data and similarity values for the 
similarity between the sets of data; 

calculating the optimum arrangement of 
representations for said sets of data on display meanc so 
that said representations are spaced according to said 
similarity values; 

controlling said display means to display said 
representations arranged in accordance with said 
calculations ; 

allowing the user to select and move a said 
representation ; and 

recalculating the optimum arrangement of said 
representations following the movement of a said 
representation . 

59. A data display method according to claim 58 wherein 
the calculation of the optimum arrangement of the 
representations is carried out as iterative calculations 



of -the separations of said representations from starting 
separations to target separations. 

60. A data display method according to claim 59 wherein 
said display means is controlled to display said 
representations at time sequential stages of said 
iterative calculations. 

61. A data display apparatus for displaying the 
relationships between sets of data, the apparatus 
comprising: 

data receiving means for receiving sets of data and 
similarity values for the similarities between the sets 
of data; 

arrangement calculation means for iteratively 
calculating the optimum separations of displayed 
representations for said sets of data from starting 
separations to target separations corresponding to said 
similarity values; and 

display control means for controlling display means 
to display said representations arranged in accordance 
with said iterative calculations at time sequential 
stages of said iterative calculations. 

62. A data display method of displaying the 
relationships between sets of data, the method 
comprising : 
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receiving sets of data and similarity values for the 
similarities between the sets of data; 

iteratively calculating the optimum separations of 
displayed representations for said sets of data from 
starting separations to target separations corresponding 
to said similarity values; and 

controlling display means to display said 
representations arranged in accordance with said 
iterative calculations at time sequential stages of said 
iterative calculations . 

63. A storage medium storing instructions for 
controlling a processing to carry out the method of any 
one of claims 20, 21, 22, 23, 24, 25, 26, 27, 29, 30, 31, 
32, 33, 34, 35, 36, 37, 38, 44, 45, 46, 47, 48, 52, 53, 
54, 58, 59, 60 and 62. 

64. A signal carrying instructions for controlling a 
processor to carry out the method of any one of claims 
20 to 38, 44 to 48, 52, to 54 and 58 to 62. 

65. A computer program for implementation by a computer 
to carry out the method of any one of claims 20 to 38, 44 
to 48, 52, to 54 and 58 to 62. 

66. A method comprising the combination of any one of 
claims 20 to 38, 44 to 48, 52, to 54 and 58 to 62. 
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67- Apparatus comprising the combination of any one of 
claims 1 to 19, 39 to 43, 49 to 51, 55 to 57 and 61. 
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ABSTRACT 
SEARCH ENGINE USER INTERFACE 

A database interface is provided for interfacing to 
a database having a plurality of sets of non-text data. 
5 Each set of non-text data has a descriptive text caption 
associated with it. An input text query is received and 
compared with the captions to determine the similarities 
between the input query and the captions. A display is 
controlled to display representations of a plurality of 
10 sets of non-text data which have captions which are the 
most similar to the input query and which are separated 
in accordance with similarities* 
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